Picture for Liang Zheng

Liang Zheng

Gromov Wasserstein Optimal Transport for Semantic Correspondences

Add code
Feb 03, 2026
Viaarxiv icon

Flexible Geometric Guidance for Probabilistic Human Pose Estimation with Diffusion Models

Add code
Feb 03, 2026
Viaarxiv icon

Mosaic: Unlocking Long-Context Inference for Diffusion LLMs via Global Memory Planning and Dynamic Peak Taming

Add code
Jan 10, 2026
Viaarxiv icon

InpaintHuman: Reconstructing Occluded Humans with Multi-Scale UV Mapping and Identity-Preserving Diffusion Inpainting

Add code
Jan 05, 2026
Viaarxiv icon

JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation

Add code
Dec 28, 2025
Viaarxiv icon

What matters for Representation Alignment: Global Information or Spatial Structure?

Add code
Dec 11, 2025
Viaarxiv icon

Effective Training Data Synthesis for Improving MLLM Chart Understanding

Add code
Aug 08, 2025
Viaarxiv icon

Vec2Face+ for Face Dataset Generation

Add code
Jul 23, 2025
Viaarxiv icon

DiSA: Diffusion Step Annealing in Autoregressive Image Generation

Add code
May 26, 2025
Viaarxiv icon

REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers

Add code
Apr 14, 2025
Viaarxiv icon